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OM protein - protein search, using sw model 
Run on: January 25, 2 0 04, 17:08:19 



Title: 



; Search time 2 0 Seconds 

(without alignments) 

1759.887 Million cell updates/sec 



US-10-060-848-3 
Perfect score: 1942 

Sequence: 1 MNYSKDAPEFWSPKDAREF ALEELATKCDVQMFSYKRLK 366 

Scoring table: 



BLOSUM62 
Gapop 10.0 , Gapext 0.5 



Searched: 283308 seqs, 96168682 residues 

Total number of hits satisfying chosen parameters: 



283308 



Minimum DB seq length: 
Maximum DB seq length: 



2000000000 



Post-processing : 



Database 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

PIR 76:* 



pirl : * 
pir2 : * 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total *score distribution. 
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ALIGNMENTS 



RESULT 1 
T20396 

hypothetical protein F36A2.3 - Caenorhabditis elegans 
C;Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 21-Jan-2000 
C;Accession: T20396; T21836 
R/Lennard, N. 

submitted to the EMBL Data Library, June 1997 
A; Reference number: Z19266 
A; Accession: T2 03 96 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A;Residues: 1-372 <WIL> 

A;Cross-references : EMBL:Z96047; PIDN: CAB09417 . 1; GSPDB : GN00019 ; CESP:F36A2.3 
A; Experimental source: clone DY3 
R/Lennard, N. 

submitted to the EMBL Data Library, October 1996 
A/Reference number: Z19476 
A; Access ion: T2183 6 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 



A;Residues: 1-372 <WI2> 

A;Cross-references: EMBL:Z81077; PIDN:CAB03073 . 1; GSPDB : GN00019 ; CESP:F36A2.3 

A; Experimental source: clone F36A2 

C; Genetics : 

A;Gene: CESP:F36A2.3 

A /Map position: 1 

A;Introns: 21/3; 53/2; 79/1; 114/3; 147/3; 198/3; 249/1; 284/3; 353/3 
C; Super family: malate dehydrogenase ylbC 



Query Match 51.9%; Score 1008; DB 2; 

Best Local Similarity 54.6%; Pred. No. 1.2e-72; 
Matches 194; Conservative 48; Mismatches 111; 



Length 3 72; 
Indels 2; 



Gaps 



2; 



Qy 

Db 

Qy 

Db 



6 DAPEFWSPKDAREFWKCMQTVGTSPDHAGQLADLLLDADLVGHYSHGLNRLHIYVDD- 64 

: I I : I : I I = = I = I = III llh II =1 I I I I I h I I I I I I I I 

11 ETDEIVISKEKLDSFVLECLAKAGCTGDHAQQLAETLLCSDYRGHYSHGINRLHIYVHDL 70 

65 VKNGVKGNGVPKVLKQKGGTAWVDGENLLGAWGNFCTDLAIKLAKEFGVAWWTKNSNH 124 

I hill 1 1 IMIII 1 1 1 1 IMIII II- Mllh III Mill 

71 MMKSTAVTGTPQVLKSKGSTAWVDGNNLLGPWGNFCMQLAVEKAKEFGIGWWCRNSNH 13 0 



Qy 



Db 



12 5 YGACQHYTKKIANAGMVGMSFTNTSPLMFPCRSSEIGLGTNPLSCCVNSEKTGDSFLLDM 184 

M I hllhllMM Ml I I Ihlh I MM Ml 

131 FGIAGWYADFACRNGLVGMAFTNTSPCVFPTGSREKSLGSNPI -CMAAPGMEGDSFFLDM 189 



Qy 

Db 

Qy 
Db 

Qy 
Db 



185 ATTTVALGKVELADCRGKTQIPSTWGADSKGNPSTDTQWLHGGGLLPLGGIEETGSYKG 244 

IMIII Ihh I MM II MMI I : : : : 1 1 MM MM I M Ml 
190 ASTTVAYGKIEWDRKGETYIPGSWGADKNGDETHNPKEVLDGGGLQPLGGSEITGGYKG 249 

245 TGLSMMGELFCGILAGSSFGKNVRLWGQSHKAADNGQCFVAIDQECFAPGFAPRLQQFLD 304 

III II h llh Ihlllhl I : I II llllllll IMMIh MIM I 
250 TGLCMMVEVLCGIMGGSAFGKNIRQWQTTSKTADLGQCFVAIDPECFAPGFSNRLQEFCD 309 

3 05 ETRNLKPISEEKPVLVPGDPERMNTEYSQKAGGLVYQEGQIKALEELATKCDVQM 359 

MIM Ih M IMIMI : MMI- h I: M : I I 

310 ETRNLNPINPSRPPQVPGDPERAHMNMCDDLGGIVYKKKQLDHLKNLADRLGVIM 364 



RESULT 2 
T18570 

probable malate dehydrogenase (EC 1.1.1.37) - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 18-Feb-2000 
C;Accession: T18570; T20833 
R;Ains cough, R. 

submitted to the EMBL Data Library, December 1998 
A;Reference number: Z18979 
A; Accession: T1857 0 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A;Residues: 1-400 <WIL> 

A;Cross-references : EMBL : AL033535 ; PIDN: CAA22133 . 1 ; CESP : VF13D12L . 3 
A; Experimental source: clone VF13D12L 
R;Coles 7 L. 

submitted to the EMBL Data Library, April 1995 
A;Reference number: Z19330 
A; Accession: T2 0833 



A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A/Residues: 1-400 <WI2> 

A; Cross-references: EMBL : Z4 9127 ; PIDN : CAA88951 . 1 ; GSPDB : GN00020 ; CESP : VF13D12L . 3 
A; Experimental source: clone F13D12 
C; Genetics : 

A;Gene: CESP : VF13D12L . 3 
A; Map position: 2 

A;Introns: 53/3; 96/1; 179/3; 230/3; 276/1; 315/3; 347/3; 384/3 
C;Superf amily : malate dehydrogenase ylbC 
C ; Keywords : oxidoreductase 

Query Match 47.6%; Score 924; DB 2; Length 40 0; 

Best Local Similarity 53.8%; Pred. No. 6.5e-66; 

Matches 191; Conservative 48; Mismatches 110; Indels 6; Gaps 5; 

Qy 7 AP-EFWSPKDAREFWKCMQTVGTSPDHAGQLADLLLDADLVGHYSHGLNRLHIYVDDV 65 

llllh : : hi HI II : I I I I I : I I : I : I I I I I I I I I I = I I I = 
Db 43 APEESWAKDEMKRFMVECMTKVGATESHATQLALVLLEGDIRGHYSHGLNRLDMYVRDI 102 

Qy 66 KNGV-KGNGVPKA/LKQKGGTAWVDGENLLGAWGNFCTDLAIKLAKEFGVAWVVTKNSNH 124 

= I Ihl I =11 = 1 lllllll 1 1 1 1 llllll 1111= II h III I III 

Db 103 EQNVCKGDGEPIILKEKAGTAWVDGNNLLGPWGNFCMDLAIEKAKNAGIGWWAKGSNH 162 

Qy 125 YGACQHYTKKIANAGMVGMSFTNTSPLMFPCRSSEIGLGTNPLSCCVNSEKTG-DSFLLD 183 

II I : Ihlll llllh II Ih llllhl : = II Ihl 

Db 163 YGIAGWYALRAMKKGMLGMSMTNTSPISFPTRSAVPALGTNPIS - -LAAPGTGDDSFVLD 220 

Qy 184 MATTTVALGKVELADCRGKTQIPSTWGADSKGNPSTDTQWLHGGGLLPLGGIEETGSYK 24 3 

Ihl I I hi I I Ml I : :| =M I Ml I h I I I I II M h I M I I 

Db 221 MASTTVAIGKVELA-ARKENPVPLSWGVGEGGKETTDPTKVLYGGGLLPLGGVEVSGGYK 27 9 

Qy 244 GTGLSMMGELFCGILAGSSFGKNVRLWGQSHKAADNGQCFVAIDQECFAPGFAPRLQQFL 3 03 

I III I hllMMh M Ml I ; II Mill I Mill III h 

Db 280 GYGLSSMIEIFCGILAGAHWGPHVRKWMSTKSEADLGQCFVAIDPEAFAPGFADRLQDFM 33 9 

Qy 3 04 DETRNLKPISEEKPVLVPGDPERMNTEYSQKAGGLVYQEGQIKALEELATKCDVQ 3 58 

II I 111111= == Ih I = M = =11 I 1 = 

Db 340 QTMRALPTSSPSFKVEVAGDMERRHEALVEQLGGIPYHKNQITFVNDLAAKLGVK 394 



RESULT 3 
B71073 

probable malate dehydrogenase - Pyrococcus horikoshii 
C; Species: Pyrococcus horikoshii 

C;Date: 14-Aug-1998 #sequence_revision 14-Aug~1998 #text_change 20-Jun-2000 
C; Accession: B71073 

R; Kawarabayasi , Y. ; Sawada, M.; Horikawa, H. ; Haikawa, Y.; Hino, Y. ; Yamamoto, 
S.; Sekine, M. ; Baba, S.; Kosugi, H. ; Hosoyama, A.; Nagai, Y. ; Sakai, M. ; Ogura, 
K. ; Otsuka, R. ; Nakazawa, H.; Takamiya, M.; Ohfuku, Y. ; Funahashi, T. ; Tanaka, 
T.; Kudoh, Y.; Yamazaki , J.; Kushida, N. ; Oguchi , A.; Aoki, K. ; Yoshizawa, T. ; 
Nakamura, Y.; Robb, F.T.; Horikoshi, K. ; Masuchi, Y.; Shizuya, H. ; Kikuchi, H. 
DNA Res. 5, 55-76, 1998 

A; Title: Complete sequence and gene organization of the genome of a hyper- 
thermophilic archaebacterium, Pyrococcus horikoshii OT3 . 
A;Reference number: A71000; MUID : 98344137 ; PMID:9679194 
A; Accession: B7107 3 



A; Status: preliminary; nucleic acid sequence not shown; translation not shown 
A; Molecule type: DNA 
A;Residues: 1-360 <KAW> 

A; Cross-references: GB:AP000005; NID : g3236132 ; PIDN : BAA303 80 . 1 ; PID:g3257697 
A; Experimental source: strain OT3 

A;Note: this accession replaces an interim accession for a sequence replaced by 

GenBank 

C; Genetics : 

A;Gene: PH1277 

C; Superf amily : malate dehydrogenase ylbC 

Query Match 26.3%; Score 511; DB 2; Length 360; 

Best Local Similarity 36.9%; Pred. No. 4.6e-33; 

Matches 12 8; Conservative 66; Mismatches 13 9; Indels 14; Gaps 8; 

Qy 10 FWSPKDAR - EFWKCMQTVGTSPDHAGQLADLLLDADLVGHYSHGLNRLHI YVDD - VKN 67 

:: III hh : :| =1 : I I I = I I I I . I I I = I I III = 

Db 11 YIRVPKDRLFSFIVRVLTKLGVPEEDAKIVADNLVT4ADLRGVESHGVQRLKRYVDGIISG 70 

Qy 68 GVKGNGVPKVLKQKGGTAWVDGENLLGAWGNFCTDLAIKLAKEFGVAWWTKNSNHYGA 127 

II : :|::: I =11= II III INI Ih h h =111111 

Db 71 GVNLHPKIRVIREGPSYALIDGDEGLGQWGYRSMKLAIKKAKDTGIGIVIARNSNHYGI 13 0 

Qy 12 8 CQHYTKKIANAGMVGMSFTNTSPLMFPCRSSEIGLGTNPLSCCVNSEKTGDSFLLDMATT 187 

= 1 I 11 = 1 = 1 Ih Ih I I 11111 = : =: I I II II h 

Db 131 AGYYALMAAEEGMIGISMTNSRPLVAPTGGIERILGTNPIALAAPTK- -DKPFLLDMATS 188 

Qy 188 TVALGKVELADCRGKTQIPSTWGADSKGNPSTDTQWLHGGGLLPLGGIEE-TGSYKGTG 246 

I : I I = I : =11 II I = =11 =1 = I =11 MINI I I =11 I 
Db 189 WPIGKLEVYRRKGK-DIPEGWAINREGNITTKVEEVFNGGALLPLGGFGELLGGHKGYG 247 

Qy 247 LSMMGELFCGILAGSSFGKNVRLWGQSHKAADNGQCFVAIDQECFAP - -GFAPRLQQFLD 3 04 

Ihl == llhl =: I h II- h II I I I I == | = = 

Db 248 LSLMVDILSGILSGGTWSKYVK- -NTSEKGSNVCHFFMVIDIEHFIPLEEFKEKISQMIE 305 

Qy 305 ETRNLKPISEEKPVLVPGDPERMNTEYSQKAGGLVYQEGQIKALEEL 351 

| :: : | : : : |: : | | | :|: | |||| 

Db 306 EIKSSRKHPEFERIWIHGEKGFLTMETRLKLGIPIYR KVLEEL 348 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: January 25, 2004, 17:06:33 ; Search time 18 Seconds 

(without alignments) 
956.210 Million cell updates/sec 



Title: US -10 -060-848-3 

Perfect score: 1942 

Sequence: 1 MNYSKDAPEFWSPKDAREF . . ALEELATKCDVQMFSYKRLK 366 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 127863 seqs, 47026705 residues 

Total number of hits satisfying chosen parameters : 127863 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : SwissProt_41 : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 
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No. Score Match Length DB ID Description 
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ALIGNMENTS 



RESULT 1 
MDH_PYRH0 

ID MDH_PYRH0 STANDARD; PRT; 36 0 AA. 

AC 059028; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Malate dehydrogenase (EC 1.1.1.37). 

GN MDH OR PH1277. 

OS Pyrococcus horikoshii. 

OC Archaea; Euryarchaeota; Thermococci; Thermococcales ; Thermococcaceae ; 

OC Pyrococcus . 

OX NCB I_Tax ID= 5 3 953; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=0T3 ; 

RX MEDLINE=98344137; PubMed=967 9194 ; 

RA Kawarabayasi Y. , Sawada M. , Horikawa H. , Haikawa Y., Hino Y. , 

RA Yamamoto S., Sekine M. , Baba S.-I., Kosugi H. , Hosoyama A., Nagai Y. , 

RA Sakai M. , Ogura K. , Otsuka R. , Nakazawa H. , Takamiya M. , Ohfuku Y. , 

RA Funahashi T. , Tanaka T. , Kudoh Y. , Yamazaki J., Kushida N. , Oguchi A., 

RA Aoki K.-I., Yoshizawa T., Nakamura Y. , Robb F.T., Horikoshi K. , 

RA Masuchi Y. , Shizuya H. , Kikuchi H. ; 

RT "Complete sequence and gene organization of the genome of a hyper- 

RT thermophilic archaebacterium, Pyrococcus horikoshii 0T3 . " ; 



RL DNA Res . 5:55-76(1998) 



- CATALYTIC ACTIVITY: (S) -malate + NAD ( +) = oxaloacetate + NADH. 

- SUBUNIT: Homodimer (By similarity) . 

- SUBCELLULAR LOCATION: Cytoplasmic (Probable) . 

- SIMILARITY: BELONGS TO THE LDH2/MDH2 OXDIDOREDUCTASE FAMILY . 



CC 

cc 

CC 
CC 

CC -- ~- - 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AP000005; BAA30380.1; 

DR PIR; B71073; B71073 . 

DR InterPro; IPR003767; ldh_2 . 

DR Pfam; PF02 615; ldh_2 ; 1. 

KW Oxidoreductase; Tricarboxylic acid cycle; NAD; Complete proteome . 

SQ SEQUENCE 360 AA; 39751 MW; 6E9D8B16ECDE6E6F CRC64 ; 

Query Match 26.3%; Score 511; DB 1; Length 3 60; 

Best Local Similarity 36.9%; Pred. No. 7.7e-35; 

Matches 12 8; Conservative 66; Mismatches 13 9; Indels 14; Gaps 8; 

Qy 10 FWSPKDAR-EFWKCMQTVGTSPDHAGQLADLLLDADLVGHYSHGLNRLHIYVDD-VKN 67 

III hh : :| I = I. I h III I Mh II Ml 

Db 11 YIRVPKDRLFSFIVRVLTKLGVPEEDAKIVADNLVMADLRGVESHGVQRLKRYVDGIISG 70 

Qy 68 GVKGNGVPKVLKQKGGTAWVDGENLLGAVVGNFCTDLAIKLAKEFGVAWVVTKNSNHYGA 12 7 

II : :|::: I :|h II III I I I I Ih h h MMIM 

Db 71 GVNLHPKIRVIREGPSYALIDGDEGLGQWGYRSMKLAIKKAKDTGIGIVIARNSNHYGI 130 

Qy 128 CQHYTKKIANAGMVGMSFTNTSPLMFPCRSSEIGLGTNPLSCCVNSEKTGDSFLLDMATT 187 

I I Ihhl Ih Ih I I Mill- llllllh 

Db 131 AGYYALMAAEEGMIGISMTNSRPLVAPTGGIERILGTNPIALAAPTK- -DKPFLLDMATS 18 8 

Qy 188 TVALGKVELADCRGKTQIPSTWGADSKGNPSTDTQWLHGGGLLPLGGIEE-TGSYKGTG 246 

I Mhh Ml II I s : I I : | : | : | I | | | | | I ' I I Ml I 
Db 189 WP I GKLE VYRRKGK - D I PEG WAINREGN I TTKVEE VFNGGALL PLGGFGELLGGHKG YG 24 7 

Qy 247 LSMMGELFCGILAGSSFGKNVRLWGQSHKAADNGQCFVAIDQECFAP - -GFAPRLQQFLD 304 

MM 111 = 1 := I h I I - h II I I I I - I - 

Db 24 8 LSLMVDILSGILSGGTWSKYVK- -NTSEKGSNVCHFFMVIDIEHFIPLEEFKEKISQMIE 3 05 

Qy 3 05 ETRNLKPISEEKPVLVPGDPERMNTEYSQKAGGLVYQEGQIKALEEL 351 

| :: : | : : : |: : | | | :|: MM 
Db 306 E I KS SRKHPEFERIWIHGEKGFLTMETRLKLGI PI YR KVLEEL 348 



